The Annals of Statistics 

2011, Vol. 39, No. 4, 2205-2242 

DOI: 10.1214/11-AOS901 

© Institute of Mathematical Statistics. 2011 



STATISTICAL INFERENCE FOR TIME-CHANGED LEVY 
PROCESSES VIA COMPOSITE CHARACTERISTIC FUNCTION 

ESTIMATION 

By Denis Belomestny 

Duisburg-Essen University 

In this article, the problem of semi-parametric inference on the 
parameters of a multidimensional Levy process L t with independent 
components based on the low-frequency observations of the corre- 
sponding time-changed Levy process L-j-{t), where T is a nonnega- 
tive, nondecreasing real- valued process independent of Lt, is studied. 
We show that this problem is closely related to the problem of com- 
posite function estimation that has recently gotten much attention 
in statistical literature. Under suitable identifiability conditions, we 
propose a consistent estimate for the Levy density of Lt and derive 
the uniform as well as the pointwise convergence rates of the estimate 
proposed. Moreover, we prove that the rates obtained are optimal in 
a minimax sense over suitable classes of time-changed Levy models. 
Finally, we present a simulation study showing the performance of 
our estimation algorithm in the case of time-changed Normal Inverse 
Gaussian (NIG) Levy processes. 

1. Introduction. The problem of nonparametric statistical inference for 
jump processes or more generally for semimartingale models has long history 
and goes back to the works of Rubin and Tucker (1959) and Basawa and 
Brockwell (1982). In the past decade, one has witnessed the revival of interest 
in this topic which is mainly related to a wide availability of financial and 
economical time series data and new types of statistical issues that have 
not been addressed before. There are two major strands of recent literature 
dealing with statistical inference for semimartingale models. The first type of 
literature considers the so-called high-frequency setup, where the asymptotic 
properties of the corresponding estimates are studied under the assumption 
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that the frequency of observations tends to infinity. In the second strand of 
literature, the frequency of observations is assumed to be fixed (the so-called 
low-frequency setup) and the asymptotic analysis is done under the premiss 
that the observational horizon tends to infinity. It is clear that none of the 
above asymptotic hypothesis can be perfectly realized on real data and they 
can only serve as a convenient approximation, as in practice the frequency 
of observations and the horizon are always finite. The present paper studies 
the problem of statistical inference for a class of semimartingale models in 
low- frequency setup. 

Let X = (X t )t>o be a stochastic process valued in M. d and let T = (T(s)) s >o 
be a nonnegative, nondecreasing stochastic process not necessarily indepen- 
dent of X with T(0) = 0. A time-changed process Y = (Y s ) s >q is then defined 
as Y s = X-j-( s y The process T is usually referred to as time change. Even 
in the case of the one-dimensional Brownian motion X , the class of time- 
changed processes Xj- is very large and basically coincides with the class of 
all semimartingales [see, e.g., Monroe (1978)]. In fact, the construction in 
Monroe (1978) is not direct, meaning that the problem of specification of 
different models with the specific properties remains an important issue. For 
example, the base process X can be assumed to possess some independence 
property (e.g., X may have independent components), whereas a nonlinear 
time change can induce deviations from the independence. Along this line, 
the time change can be used to model dependence for stochastic processes. 
In this work, we restrict our attention to the case of time-changed Levy pro- 
cesses, that is, the case where X = L is a multivariate Levy process and T 
is an independent of L time change. Time-changed Levy processes are one 
step further in increasing the complexity of models in order to incorporate 
the so-called stylized features of the financial time series, like volatility clus- 
tering [for more details, see Carr et al. (2003)]. This type of processes in the 
case of the one-dimensional Brownian motion was first studied by Bochner 
(1949). Clark (1973) introduced Bochner's time-changed Brownian motion 
into financial economics: he used it to relate future price returns of cotton to 
the variations in volume during different trading periods. Recently, a number 
of parametric time-changed Levy processes have been introduced by Carr 
et al. (2003), who model the stock price St by a geometric time-changed 
Levy model 

S t = Soexp(L r(i) ), 
where L is a Levy process and T(t) is a time change of the form 

(1.1) T{t) = [ p{u)du 

Jo 

with {p(u)} u >o being a positive mean-reverting process. Carr et al. (2003) 
proposed to model p(u) via the Cox-Ingersoll-Ross (CIR) process. Taking 
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different parametric Levy models for L (such as the normal inverse Gaussian 
or the variance Gamma processes) results in a wide range of processes with 
rather rich volatility structure (depending on the rate process p) and var- 
ious distributional properties (depending on the specification of L). From 
statistical point of view, any parametric model (especially one using only 
few parameters) is prone to misspecification problems. One approach to 
deal with the misspecification issue is to adopt the general nonparametric 
models for the functional parameters of the underlying process. This may 
reduce the estimation bias resulting from an inadequate parametric model. 
In the case of time-changed Levy models, there are two natural nonpara- 
metric parameters: Levy density v, which determines the jump dynamics of 
the process L and the marginal distribution of the process T ■ 

In this paper, we study the problem of statistical inference on the char- 
acteristics of a multivariate Levy process L with independent components 
based on low-frequency observations of the time-changed process Yt = -^T(t) > 
where T(t) is a time change process independent of L with strictly station- 
ary increments. We assume that the distribution of T(t) is unknown, except 
of its mean value. This problem is rather challenging and has not been yet 
given attention in the literature, except for the special case of T(t) = t [see, 
e.g., Neumann and ReiB (2009) and Comte and Genon-Catalot (2010)]. In 
particular, the main difficulty in constructing nonparametric estimates for 
the Levy density v of L lies in the fact that the jumps are unobservable 
variables, since in practice only discrete observations of the process Y are 
available. The more frequent the observations, the more relevant informa- 
tion about the jumps of the underlying process, and hence, about the Levy 
density v are contained in the sample. Such high-frequency based statistical 
approach has played a central role in the recent literature on nonparametric 
estimation for Levy type processes. For instance, under discrete observations 
of a pure Levy process L t at times tj = jA,j = 0, . . . , n, Woerner (2003) and 
Figueroa-Lopez (2004) proposed the quantity 



where / is a given "test function." Turning back to the time-changed Levy 
processes, it was shown in Figueroa-Lopez (2009) [see also Rosenbaum and 
Tankov (2010)] that in the case, where the rate process p in (1.1) is a positive 
ergodic diffusion independent of the Levy process L, /3(f) is still a consistent 
estimator for /3(f) up to a constant, provided the time horizon nA and the 
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as a consistent estimator for the functional 
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sampling frequency A converge to infinite at suitable rates. In the case 
of low- frequency data (A is fixed), we cannot be sure to what extent the 
increment Lt k — Lt k _ 1 is due to one or several jumps or just to the diffusion 
part of the Levy process so that at first sight it may appear surprising that 
some kind of inference in this situation is possible at all. The key observation 
here is that for any bounded "test function" / 

1 n 

(1.2) -E/( L rfe) " ^Tfe_ 1 ))^E 7r [/(L r(A) )], n^oo, 
71 i=i 

provided the sequence T(tj) — T(tj-i),j = l,...,n, is stationary and er- 
godic with the invariant stationary distribution ir. The limiting expectation 
in (1.2) is then given by 

/•oo 

E n [f(L r(A) )] = / E[f(L s )Mds). 
Jo 

Taking f(z) = f u (z) = exp(iu z),u G and using the independence of L 
and T, we arrive at the following representation for the c.f. of Lf( s ) '■ 

POO 

(1.3) E[exp(m T L r(A) )]= / exp(ty(u))7r(dt) = £ A (-ip(u)), 

Jo 

where ip( u ) = t^ 1 log[Eexp(iiiLf )] is the characteristic exponent of the Levy 
process L and £ A is the Laplace transform of it. In fact, the most difficult 
part of estimation procedure comes only now and consists in reconstruct- 
ing the characteristics of the underlying Levy process L from an estimate 
for £ A (— ifi(u)). As we will see, the latter statistical problem is closely re- 
lated to the problem of composite function estimation, which is known to 
be highly nonlinear and ill-posed. The identity (1.3) also reveals the major 
difference between high-frequency and low-frequency setups. While in the 
case of high-frequency data one can directly estimate linear functionals of 
the Levy measure u, under low-frequency observations, one has to deal with 
nonlinear functionals of v rendering the underlying estimation problem non- 
linear and ill-posed. Last but not least, the increments of time-changed Levy 
processes are not any longer independent, hence advanced tools from time 
series analysis have to be used for the estimation of £ A (— ift(u)). 

The paper is organized as follows. In Section 2.1, we introduce the main 
object of our study, the time-changed Levy processes. In Section 2.2, our sta- 
tistical problem is formulated and its connection to the problem of composite 
function estimation is established. In Section 2.3, we impose some restric- 
tions on the structure of the time-changed Levy processes in order to ensure 
the identifiability and avoid the "curse of dimensionality." Section 3 contains 
the main estimation procedure. In Section 4, asymptotic properties of the es- 
timates defined in Section 3 are studied. In particular, we derive uniform and 
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pointwise rates of convergence (Sections 4.3 and 4.4, resp.) and prove their 
optimality over suitable classes of time-changed Levy models (Section 4.5). 
Section 4.7 contains some discussion. Finally, in Section 5 we present a sim- 
ulation study. The rest of the paper contains proofs of the main results and 
some auxiliary lemmas. In particular, in Section 7.3 a useful inequality on 
the probability of large deviations for empirical processes in uniform metric 
for the case of weakly dependent random variables can be found. 

2. Main setup. 

2.1. Time-changed Levy processes. Let Lt be a d-dimensional Levy pro- 
cess on the probability space (f2, J 7 , P) with the characteristic exponent ip{u), 
that is, 

iP(u) =t~ 1 logE[exp(in T L t )]. 
We know by the Levy-Khintchine formula that 

(2.1) ^{u)=ipJu- l-u T J:u+ [ (e iuTy -l-iuVl {M < 1} Mdy), 

where p € £ is a positive-semidefinite symmetric d x d matrix and v is 
a Levy measure on M rf \ {0} satisfying 

/ (\y\ 2 M)v{dy) <oo. 

jR d \{0} 

A triplet (//, £, u) is usually called a characteristic triplet of the <i-dimensional 
Levy process L t . 

Let t — > T{t) , t > be an increasing right-continuous process with left 
limits such that 7~(0) = and for each fixed t, the random variable T(t) 
is a stopping time with respect to the filtration T . Suppose furthermore 
that T(t) is finite P-a.s. for all t > and that T(t) — > oo as t — > oo. Then 
the family of (T(t))t>o defines a random time change. Now consider a d- 
dimensional process Y t := Lq-r t y The process Y t is called the time-changed 
Levy process. Let us look at some examples. If 7~(t) is a Levy process, then Y t 
would be another Levy process. A more general situation is when T(t) is 
modeled by a nondecreasing semimartingale 

T(t) = bt+ / yp(dy,ds), 
Jo Jo 

where b is a drift and p is the counting measure of jumps in the time change. 
As in Carr and Wu (2004), one can take bt = and consider locally deter- 
ministic time changes 

(2.2) T(t)= [ p(s^)ds, 

Jo 

where p is the instantaneous activity rate which is assumed to be nonnega- 
tive. When Lt is the Brownian motion and p is proportional to the instan- 
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taneous variance rate of the Brownian motion, then Yt is a pure jump Levy 
process with the Levy measure proportional to p. Let us now compute the 
characteristic function of Y%. Since T(t) and Lt are independent, we get 

(2.3) <j> Y {u\t) = E(e iuTL rw) = C t (-^(u)), 

where Ct is the Laplace transform of Tit): 

A(A) = E(e- Ar W). 

2.2. Statistical problem. In this paper, we are going to study the problem 
of estimating the characteristics of the Levy process L from low-frequency 
observations Yq, . . . , Y n/ \ of the process Y for some fixed A > 0. Moving 
to the spectral domain and taking into account (2.1), we can reformulate 
our problem as the problem of semi-parametric estimation of the charac- 
teristic exponent ip under structural assumption (2.1) from an estimate of 
4>y(u\A) based on Yo,Y&, . . . ,Y n &. The formula (2.3) shows that the func- 
tion (/)y(u\A) can be viewed as a composite function and our statistical 
problem is hence closely related to the problem of statistical inference on 
the components of a composite function. The latter type of problems in re- 
gression setup has gotten much attention recently [see, e.g., Horowitz and 
Mammen (2007) and Juditsky, Lepski and Tsybakov (2009)]. Our problem 
has, however, some features not reflected in the previous literature. First, the 
unknown link function C/s., being the Laplace transform of the r.v. 7~(A), 
is completely monotone. Second, the complex-valued function ip is of the 
form (2.1) implying, for example, a certain asymptotic behavior of ip{u) 
as u — > oo. Finally, we are not in regression setup and </>y(u|A) is to be 
estimated by its empirical counterpart 

i n 

^( U ) = i^ e - T (^->A( J - 1 ,). 

U i=i 

The contribution of this paper to the literature on composite function esti- 
mation is twofold. On the one hand, we introduce and study a new type of 
statistical problems which can be called estimation of a composite function 
under structural constraints. On the other hand, we propose new and con- 
structive estimation approach which is rather general and can be used to 
solve other open statistical problems of this type. For example, one can 
directly adapt our method to the problem of semi-parametric inference 
in distributional Archimedian copula-based models [see, e.g., McNeil and 
Neslehova (2009) for recent results], where one faces the problem of esti- 
mating a multidimensional distribution function of the form 

F(xi,. ..,x d ) = G(/i(xi) + • • • + f d {x d )), (xi, . . . ,x d ) € M d , 

with a completely monotone function G and some functions fi,-.-,fd- Fur- 
ther discussion on the problem of composite function estimation can be 
found in Remark 4.14. 
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2.3. Specification analysis. It is clear that without further restrictions on 
the class of time-changed Levy processes our problem of estimating v is not 
well defined, as even in the case of the perfectly known distribution of the 
process Y the parameters of the Levy process L are generally not identifiable. 
Moreover, the corresponding statistical procedure will suffer from the "curse 
of dimensionality" as the dimension d increases. In order to avoid these 
undesirable features, we have to impose some additional restrictions on the 
structure of the time-changed process Y. In statistical literature, one can 
basically find two types of restricted composite models: additive models and 
single-index models. While the latter class of models is too restrictive in our 
situation, the former one naturally appears if one assumes the independence 
of the components of Lf. In this paper, we study a class of time-changed 
Levy processes satisfying the following two assumptions: 

(ALI) The Levy process Lt has independent components such that at 
least two of them are nonzero, that is, 

(2.4) (t> Y {u\t) = C t (-Mui) M»d)), 

where tpk, k = 1, . . . , d, are the characteristic exponents of the components 
of Lt of the form 

(2.5) 

+ / (e mx - l-iux-ln x \< 1} )v k (dx), k = l,...,d, 
Jm. 

and 

(2.6) \m\+af+ [ x 2 vi(dx)^0 

Jr 

for at least two different indexes I. 

(ATI) The time change process T is independent of the Levy process L 
and satisfies E[T(i)] = t. 

Discussion. The advantage of the modeling framework (2.4) is twofold. 
On the one hand, models of this type are rather flexible: the distribution 
of Yt for a fixed t is in general determined by d + 1 nonparametric compo- 
nents and 2 x d parametric ones. On the other hand, these models remain 
parsimonious and, as we will see later, admit statistical inference not suffer- 
ing from the "curse of dimensionality" as d becomes large. The latter feature 
of our model is in accordance with the well documented behavior of the ad- 
ditive models in regression setting and may become particularly important 
if one is going to use it, for instance, to model large portfolios of assets. The 
nondegeneracy assumption (2.6) basically excludes one-dimensional models 
and is not restrictive since it can be always checked prior to estimation by 
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testing that 




- *A -1),Z) 2 > 



for at least two different indexes I. Let us make a few remarks on the one- 
dimensional case, where 



If £a is known, that is, the distribution of the r.v. T(A) is known, we can 
consistently estimate the Levy measure v\ by inverting £ A (see Section 4.6 
for more details). In the case when the function £ A is unknown, one needs 
some additional assumptions (e.g., absolute continuity of the time change) 
to ensure identifiability. Indeed, consider a class of the one-dimensional Levy 
processes of the so-called compound exponential type with the characteristic 
exponent of the form 



where i/j(u) is the characteristic exponent of another one-dimensional Levy 
process L t . It is well known [see, e.g., Section 3 in Chapter 4 of Steutel 
and van Harn (2004)] that exp(^(tt)) is the characteristic function of some 
infinitely divisible distribution if exp(ip(u)) does. Introduce 



As can be easily seen, the function £ A is completely monotone with £a(0) = 1 

and £ A (0) = £' A (0). Moreover, it is fulfilled £ A (-■$>(«)) = £a(— 0(«)) for all 
u £ R. The existence of the time change (increasing) process T with a given 
marginal 7~(A) can be derived from the general theory of stochastic partial 
ordering [see Kamae and Krengel (1978)]. The above construction indicates 
that the assumption E[7~(i)] =t,t>0, is not sufficient to ensure the identi- 
fiability in the case of one-dimensional time-changed Levy models. 

3. Estimation. 

3.1. Main ideas. Assume that the Levy measures of the component pro- 
cesses L\ , . . . , Lt are absolutely continuous with integrable densities v\ (x), . . . , 
Vd(%) that satisfy 



(2.7) 



(f> Y (u\t) = Ct{-Mv)) 



t>0. 




C A {z) = C A (log{l + z)). 




k = 1, . . . , d. 



Consider the functions 



v k {x) = x 2 v k {x), k = l,...,d. 
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By differentiating ip k two times, we get 

M(u) = -ol- [ j u *u k {x)dx. 



For the sake of simplicity, in the sequel we will make the following assump- 
tion: 

(ALS) The diffusion volatilities a k , k = 1, . . . ,d, of the Levy process L are 
supposed to be known. 

A way how to extend our results to the case of the unknown (a k ) is outlined 
in Section 4.6. Introduce the functions "ip k (u) = ipk(u) + af^v? /2 to get 

(3.1) F[u k ](u) = -ij'iiu) = -a 2 k - 

where F stands for the Fourier transform of v k . Denote Z = Ya, 

4> k (u) = d Uk (f> z (u),(j) k i(u) = d UkUl <j) Z (u) and <p jk i(u) = d UjUkUl <j)z(u) for j,k, 
I € {1, ... ,d} with 

(3.2) (f> z {u) = E[exp(iu T Z)] = £ A (-^i(«i) M^d))- 

Fix some k G {1, . . . , d} and for any real number u introduce a vector 

-u (fc) = (0,...,0,u,0,...,0)eIR d 

with u being placed at the kth coordinate of the vector u^ k \ Choose some 
I 7^ k, such that the component L\ is not degenerated. Then we get from (3.2) 

if \x\ / and 

_ ijj' k {u) 



(3.4) 



in the case m = 0. The identities <^(0) = -^(0)£' A (0) and 4> u (0) = [^(O)} 2 x 
£ A (0)-t/f(0)£' A (0) imply^KO) = -[£ / A (°)]" 1 ^(0) = A-Vz(0) and^'(O) = 
-[£' A (0)]-V«(0) = A" V«(0) if $(0) = 0, since £' A (0) = -E[T(A)] = -A. 
Combining this with (3.3) and (3.4), we derive 

, //, s * 1 , , N ^fc(n (fc) )0/(n( fc ))-^(n( fc ))0 /fc (u( fc )) 

(3.5) ^( U ) = A-VK0)— ^(nW) ' 

(3.6) ip k {u) = A (f>ii(0) , w = 0. 

Note that in the above derivations we have repeatedly used assumption 
(ATI), that turns out to be crucial for the identifiability. The basic idea 
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of the algorithm, we shall develop in the Section 3.2, is to estimate v k by 
an application of the regularized Fourier inversion formula to an estimate 
of 4> k (u). As indicated by formulas (3.5) and (3.6), one could, for example, 
estimate (it) , if some estimates for the functions <f> k (it) , <fiik (u) and <pn k (it) 
are available. 



Remark 3.1. One important issue we would like to comment on is 
the robustness of the characterizations (3.5) and (3.6) with respect to the 
independence assumption for the components of the Levy process Lf. First, 
note that if the components are dependent, then the key identity (3.1) is not 
any longer valid for ip'S defined as in (3.5) or (3.6). Let us determine how 
strong can it be violated. For concreteness, assume that m > and that the 
dependence in the components of Lt is due to a correlation between diffusion 
components. In particular, let £(&,/) > 0. Since in the general case 

0i(llW) 



and d UhUk i/j(u( k >) = -a\ - F[u k ](u), we get 



,(*)• 

E(jfe,Z) 



F[u k ](u)+ip'l(u)+(T k - 

Using the fact that both functions ud Uk {(j) k (u^) /(pi(u^)} and 4> k (u^)/ 
<pi(u^) are uniformly bounded for u 6 K, we get that the model "misspeci- 
fication bias" is bounded by CE(k,l) with some constant C > 0. Thus, the 
weaker is the dependence between components L k and L l , the smaller is the 
resulting "misspecification bias." 

3.2. Algorithm. Set Zj = Y^j — ^a(j-i))J = 1, . . . ,n, and denote by Z k 
the kih coordinate of Zj. Note that Zj,j = l,...,n, are identically dis- 
tributed. The estimation procedure consists basically of three steps: 

Step 1. First, we are interested in estimating partial derivatives of the 
function 4>z( u ) up to the third order. To this end, define 



1 n 

(3.7) <f> k (u) = -Y, Z jeM™ T Z j ), 

1 n 

(3.8) <h k (u) = -J2 Z k Z\ exp(ra T ^), 

71 3=1 

1 n 

(3.9) <j> Uk {u) = -J2 Z k 3 Z\z) exp(m T Z j 



n . 
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Step 2. In a second step, we estimate the second derivative of the char- 
acteristic exponent tpk{u)- Set 

(3.10) ip kt2 {u) = A 0,(0) ~ , 

m(u ( k ')Y 

if |^(0)| > K/y/n and 

, \ T A 1- , ^fcfc(w (fc) )^(-" (fc) )-?fc(" (fc) )fe(w (fc) ) 

(3.11) Vfc,2(«)= A ^i(0)— r- , .Do " 

otherwise, where k is a positive number. 

Step 3. Finally, we construct an estimate for i>k(x) by applying the Fourier 
inversion formula combined with a regularization to ipk,2( u )'- 

(3.12) %(x) = — — — f e- iu *$ kj2 {u) + a 2 k ]IC(uh n ) du, 

27T Jr 

where tC{u) is a regularizing kernel supported on [—1, 1] and h n is a sequence 
of bandwidths which tends to as n — > oo. The choice of the sequence h n 
will be discussed later on. 



Remark 3.2. The parameter n determines the testing error for the 
hypothesis H:m>0. Indeed, if m = 0, then 4>i(0) = and by the central 
limit theorem 

P($i(0)| > n/y/n) < P(v^l^(0) - > «) 

-> P(|f| > k/\Jv&t[Z 1 ]), n -> oo, 

with £~Af(0,l). 

4. Asymptotic analysis. In this section, we are going to study the asymp- 
totic properties of the estimates z?fc(x), fe = 1, . . . ,d. In particular, we prove 
almost sure uniform as well as pointwise convergence rates for v k (x). More- 
over, we will show the optimality of the above rates over suitable classes of 
time-changed Levy models. 

4.1. Global vs. local smoothness of Levy densities. Let Lj be a one- 
dimensional Levy process with a Levy density v. Denote v{x) = x 2 v(x). 
For any two nonnegative numbers /3 and 7 such that 7 E [0, 2] consider two 
following classes of Levy densities v: 

(4.1) e/3 = {v. J (l + \uf)F[v](u)du<oo\ 
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and 

(4.2) 93 7 = {z,:/ u(y) dy X 5^ e -+ +0 j, 

I ./|j/|>« e7 J 

where II is some positive function on M + satisfying < II(+0) < oo. The 
parameter 7 is usually called the Blumenthal-Geetor index of Lt. This in- 
dex 7 is related to the "degree of activity" of jumps of L t . All Levy mea- 
sures put finite mass on the set (— 00,— e] U [e, 00) for any arbitrary e > 0. 
If i>([— e, e]) < 00 the process has finite activity and 7 = 0. If v([—€, e]) = 00, 
that is, the process has infinite activity and in addition the Levy measure 
i>((—oo, — e] U [e, 00)) diverges near at a rate |e| -7 for some 7 > 0, then the 
Blumenthal-Geetor index of Lt is equal to 7. The higher 7 gets, the more 
frequent the small jumps become. 

Let us now investigate the connection between classes Gp and 23 7 . First, 
consider an example. Let Lt be a tempered stable Levy process with a Levy 
density 

v ( x ) = T(l - 7 7 ) X ~ (7+1) 6Xp (~f ) X> °' 

where 7 £ (0, 1). It is clear that v £ 53 7 but what is about S/3? Since 

^) = r^i - 7 7) xl ~ 7 ex p( - f) 1 (o.«')( a: )' 

we derive 

/•oo 

F[P](u)= / e iu ^(x)dxx 2 7 7 (1 - 7 )e m(1 " 7/2) ^ 2+7 , 

by the Erdelyi lemma [see Erdelyi (1956)]. Hence, v cannot belong to 
as long as f3 > 1 — 7. The message of this example is that given the activity 
index 7, the parameter f3 determining the smoothness of u, cannot be taken 
arbitrary large. The above example can be straightforwardly generalized to 
a class of Levy densities supported on M+. It turns out that if the Levy 
density v is supported on [0, 00), is infinitely smooth in (0, 00) and v £ 25 7 
for some 7 £ (0, 1), then v £ <3^ for all j3 satisfying < (3 < 1 — 7 and v ^ 6^ 
for j3 > 1 — 7. As a matter of fact, in the case 7 = (finite activity case) the 
situation is different and f3 can be arbitrary large. 

The above discussion indicates that in the case v £ 23 7 with some 7 > it 
is reasonable to look at the local smoothness of instead of the global one. 
To this end, fix a point xq £ M and a positive integer number s > 1. For any 
5 > and D > introduce a class S) s (xo, 5, D) of Levy densities v defined as 

f) s (x , S,D) = lv. u(x) £ C s (]x -5,x + S[), 

(4-3) 

sup \v {l) (x)\ <D for l = l,...,s\. 

x&]xq— 5,xq+8[ 
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4.2. Assumptions. In order to prove the convergence of Pfc(cc), we need 
the assumptions listed below: 

(AL1) The Levy densities v±, . . . , are in the class 25 7 for some 7 > 0. 
(AL2) For some p > 2, the Levy densities Vk,k = 1, . . . , d, have finite ab- 
solute moments of the order p: 



I 

Jr 



x\ p Vk{x) dx < 00, fe = 1, ...,d. 



(ATI) The time change T is independent of the Levy process L and the 
sequence T& = T(Afc) — T(A(k — 1)), k € N, is strictly stationary, a-mixing 
with the mixing coefficients (aT(j))jeN satisfying 

olt{o) < aoexp(-aij), j G N, 
for some positive constants ao arid ai. Moreover, assume that 

E[r -2/7( A )] < 00j E[r 2p (A)] < 00 

with 7 and p being from assumptions (AL1) and (AL2), respectively. 
(AT2) The Laplace transform £ t (z) of T(t) fulfills 

£' t (z)=o(l), C'l{z)/C' t {z) = 0{l), \z\^oo, Rez>0. 

(AK) The regularizing kernel K, is uniformly bounded, is supported on 
[—1,1] and satisfies 

lC(u) = 1, u £ [— or:, or-], 

with some < < 1 . 

(AH) The sequence of bandwidths /i n is assumed to satisfy 



C = 0(n'- 5 ), M.^yi- log i- = »(!), n^oo, 

for some positive number <5 fulfilling 2/p < 5 < 1, where 

M n = max sup 1 0~ 1 (u (A:) ) | . 
{|u|<i/M 

Remark 4.1. By requiring G *B 7 , A; = 1, . . . ,d, with some 7 > 0, we 
exclude from our analysis pure compound Poisson processes and some infi- 
nite activity Levy processes with 7 = 0. This is mainly done for the sake of 
brevity: we would like to avoid additional technical calculations related to 
the fact that the distribution of Yt is not in general absolutely continuous 
in this case. 



Remark 4.2. Assumption (ATI) is satisfied if, for example, the pro- 
cess T(t) is of the form (1.1), where the rate process p[u) is strictly station- 
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ary, geometrically a-mixing and fulfills 

/ /-A \-2/7 
(4.4) E[p 2p (u)} < oo, u€[0,A], EN p(u)du\ < oo. 

In the case of the Cox-Ingersoll-Ross process p (see Section 5.2), assump- 
tions (4.4) are satisfied for any p > and any 7 > 0. 

Remark 4.3. Let us comment on assumption (AH). Note that in order 
to determine M n , we do not need the characteristic function (j){u) itself, 
but only a low bound for its tails. Such low bound can be constructed if, 
for example, a low bound for the tail of C' t (z) and an upper bound for the 
Blumenthal-Geetor index 7 are available [see Belomestny (2010b) for further 
discussion]. In practice, of course, one should prefer adaptive methods for 
choosing h n . One such method, based on the so called "quasi-optimality" 
approach, is proposed and used in Section 5.1. The theoretical analysis of 
this method is left for future research. 



4.3. Uniform rates of convergence. Fix some k from the set {1, 2, . . . , d}. 
Define a weighting function w(x) = log _1 ^ 2 (e + \x\) and denote 

Pk -VkWh^w) =su-p[w{\x\)\9 k {x) -9 k (x)\]. 

Let £ n be a sequence of positive r.v. and q n be a sequence of positive real 
numbers. We shall write £ n = O a . s . (q n ) if there is a constant D > such that 
P(limsup n ^ oc q' 1 ^ <D) = 1. In the case P(limsup n ^ oc g" 1 ^ = 0) = 1, we 
shall write £ n = o a . s . (<?«)• 



Theorem 4.4. Suppose that assumptions (AL1), (AL2), (ATI), (AT2), 
(AK) and (AH) are fulfilled. Let Vk{x) be the estimate for v k (x) defined in 
Section 3.2. If £ &/3 for some j3 > 0, then 



for arbitrary small e > 0, where 



\C' A {-^ k {u))\ 

Corollary 4.5. Suppose that a k = 0, 7 G (0, 1] in assumption (AL1) 
and 

\£'&(z)\ > exp(— a|z| ,? ), \z\— >oo, Rez>0, 
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Table 1 

Uniform convergence rates for in the case = 





|£'a(z)I>M- q 


|£'a(*)I> 


exp(-a|z|") 


Mfc > 


/x fc = 


fJ.k > o 


Mfe = 


n -/3/(2c+2/3 + l) 


n -/3/(2a 7 +2/3 + l) 


log~' 3/ " n 


log-^n 


x log( 3 +^/< 2Q - 


v2fi+1 \n) xlog^+^/^+^H 



for some a > and n > 0. If fik> 0, f/ien 

(/ log 3+e n \ 
A/ exp(ac • h^) + ^ j 

i/rai/i some constant c > 0. in t/ie case /i^ = we /iaue 



(/log 3+e n \ 
Y exp(ac • /i- 7r? ) + hP\. 

Choosing h n in such a way that the r.h.s. of (4-5) and (4-6) are minimized, 
we obtain the rates shown in the Table 1. If 7 G (0,1] in assumption (AL1) 
and 

|£a(*)I ~ M~ a » N-^oo. Rez>0, 
/or some a > 0, i/ien 



Ml/^''» 1/2 ~* + ''» 



l^fc _ zy fc||L 00 (R,-u ) ) 



provided > 0. /n i/ie case = 0, one /ias 



n " / 

T/ie c/ioices fr n = n -V(2(*+/»)+i) log (3+ £ )/(2(a+«+i) (n) anrf 

/!„ = n -l/(2(a7+/*)+l) log (3+ £ )/(2(a7+/3)+l) (n) 

/or i/ie cases > and = 0, respectively, lead to the bounds shown in 
Table 1. In the case > 0, the rates of convergence are given in Table 2. 

Table 2 



Uniform convergence rates for vu 


in t/ie case 


a fc >0 


Kk(*)i£i*r a 


|£k(*)|> 


exp(-a|zD 


n -/3/(4a+2/3+l) log (3+ £ )/3/(4c+2/3+l) ^ 


log" 


'"/ 2 "n 
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Remark 4.6. As one can see, assumption (AH) is always fulfilled for 
the optimal choices of h n given in Corollary 4.5, provided 07 + /3 > and 
p>2 + l/(a 7 + /3). 

4.4. Pointwise rates of convergence. Since the transformed Levy den- 
sity i>k is usually not smooth at (see Section 4.1), pointwise rates of con- 
vergence might be more informative than the uniform ones if G Q3 7 for 
some 7 > 0. It is remarkable that the same estimate as before will achieve 
the optimal pointwise convergence rates in the class Sj s (xq, 8, D), provided 
the kernel K satisfies (AK) and is sufficiently smooth. 

Theorem 4.7. Suppose that assumptions (AL1), (AL2), (ATI), (AT2), 
(AK) and (AH) are fulfilled. If G Sj s (xq, 8, D) with $) s (xo,8,D) being de- 
fined in (4-3), for some s> 1, 5 > 0, D > 0, and fC G C m (R) for some m> s, 
then 



with SHfc(it) as in Theorem 4-4- As a result, the pointwise rates of convergence 
for different asymptotic behaviors of the Laplace transform Ct coincide with 
ones given in Tables 1 and 2, if we replace (5 with s . 

Remark 4.8. If the kernel fC is infinitely smooth, then it will automat- 
ically "adapt" to the pointwise smoothness of i>k, that is, (4.7) will hold for 
arbitrary large s > 1, provided Vk G f) s (xo, 8, D) with some 8 > and D > 0. 
An example of infinitely smooth kernels satisfying (AK) is given by the so 
called flat-top kernels (see Section 5.1 for the definition). 

4.5. Lower bounds. In this section, we derive a lower bound on the min- 
imax risk of an estimate v(x) over a class of one-dimensional time-changed 
Levy processes Y± = L-j-(t) with the known distribution of T, such that the 
Levy measure v of the Levy process Lt belongs to the class &p n QS 7 with 
some P > and 7 G (0, 1] . The following theorem holds. 

Theorem 4.9. Let Lt be a Levy process with zero diffusion part, a drift [i 
and a Levy density v. Consider a time-changed Levy process Y t = Lj-^, 
where the Laplace transform of the time change Tit) fulfills 



(4.9) liminfinf sup F ( „ iT) (\\v - u\\ Loo (R, w ) > e^log 1 (l//i„))>0 



(4.7) 




(4.8) 

for k = 0, 1 




Rez>0, 
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for any e > and any sequence h n satisfying 

nA-^C'^c ■ h-t)] 2 hf +1 = O(l), n -> oo, 

in the case /i = and 

nA-^Cc • = O(l), n ^ oo, 

in i/ie case /x > 0, with some positive constant c > 0. Note that the infimum 
in (4-9) is taken over all estimators of v based on n observations of the 
r.v. Ya and P(i/,t) stands for the distribution of n copies o/ 1a. 

Corollary 4.10. Suppose that the underlying Levy process is driftless, 
that is, /i = and Ct(z) = exp(-azt) for some a > 0, corresponding to a de- 
terministic time change process T(t) = at. Then by taking 

logn — ((2/3 + l)/7) loglogn^ _1 ^ 7 



h r , 



2acA 



we arrive at 

liminfinf sup F {ViT )(\\v - u\\ Loo (R,w) > £ • A^log-^n) > 0. 

Corollary 4.11. Again let /i = 0. Take C t {z) = 1/(1 + z) aot , Rez > 
/or some ao > 0, resulting in a Gamma process T(t) (see Section 5.1 for the 
definition). Under the choice 

we get 

liminfinf sup F {v>T} (\\v-v\\ Loo w{R) > e- ( nA )-^ 2 ^+^ +1 ) log" 1 n) > 0, 

where a = QoA + 1. 

Remark 4.12. Theorem 4.9 continues to hold for A — > and there- 
fore can be used to derive minimax lower bounds for the risk of u in high- 
frequency setup. As can be seen from Corollaries 4.10 and 4.11, the rates 
will strongly depend on the specification of the time change process T ■ 

The pointwise rates of convergence obtained in (4.7) turn out to be op- 
timal over the class S) s (xo, 5, D) n 25 7 with s > 1, <5 > 0, xoEM, D > and 
7 £ (0, 1] as the next theorem shows. 

Theorem 4.13. Let Lt be a Levy process with zero diffusion part, a drift fi 
and a Levy density v. Consider a time-changed Levy process Yj = -^T(t)> 
where the Laplace transform of the time change T{t) fulfills (4-8). Then 

(4.10) liminfinf sup ¥ {uT) (\u(x ) -u(x )\ > e/i* log _1 (l//i n )) > 
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for s > 1, 5 > 0, D > 0, any e > and any sequence h n satisfying 




in the case [i > 0, iirei/t some positive constant c> 0. 
4.6. Extensions. 

One- dimensional time-changed Levy models. Let us consider a class of 
one-dimensional time-changed Levy models (2.7) with the known time change 
process, that is, the known function Ct for all t > 0. This class of models 
trivially includes Levy processes without time change [by setting Ct(z) = 
exp(— tz)] studied in Neumann and Reifi (2009) and Comte and Genon- 
Catalot (2010). We have in this case 



where C A is an inverse function for £a- Thus, ip'{(u) is again a ratio- type 
estimate involving the derivatives of the c.f. <j> up to second order, that agrees 
with the one proposed in Comte and Genon-Catalot (2010) for the case of 
pure Levy processes. Although we do not study the case of one-dimensional 
models in this work, our analysis can be easily adapted to this situation as 
well. In particular, the derivation of the pointwise convergence rates can be 
directly carried over to this situation. 

The case of the unknown (a/-). One way to proceed in the case of the 
unknown (o~k) and Vk £ ^7 with 7 < 2 is to define Vk(x) = x 4 Vk(x). Assuming 
J Vk{x) dx < 00, we get 



JR 

Hence, in the above situation one can apply the regularized Fourier inversion 
formula to an estimate of t/j^ (u) instead of tp^(u). 

Estimation of Let us first estimate ipf.. Set 



(4.11) Vi(n) 



0»£' A (-^M) - tP'iu)^-^ (u))/£' A (-Mu)) 

[C A (-Mu))] 2 



with 



V>i(u) = -£ A (4>(u)), 
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Under Assumptions (AL2), (ATI), (AT2), (AK) and (AH) we derive 

(4.12) \\i>k-i>k\\L 00 (R l ui)=O a ,. B . 



11 



with a weighting function 



w(u) 



■ dv 



/ \C' A (-Mv))\ 

Now let us define an estimate for £a as a solution of the following optimiza- 
tion problem 

(4.13) £ A = arg inf sup{w(u)\C(-Mu)) - 0(u^)\}, 

where 9JIa is the set of completely monotone functions C satisfying C(0) = 1 
and £'(0) = —A. Simple calculations and the bound (4.12) yield 



' log 3+e n 



(4.14) supMu)|£a(-^*(u)) " £a(-^*(«))I} = Oa, 
Since any function £ from 9JIa has a representation 



I? 



/■oo 

£(u) = / e - ux dF{x) 
Jo 



with some distribution function F satisfying f xdF(x) = A, we can replace 
the optimization over 9JT in (4.13) by the optimization over the correspond- 
ing set of distribution functions. The advantage of the latter approach is that 
herewith we can directly get an estimate for the distribution function of the 
r.v. T(A). A practical implementation of the estimate (4.13) is still to be 
worked out, as the optimization over the set 9JTa is not feasible and should 
be replaced by the optimization over suitable approximation classes (sieves). 
Moreover, the "optimal" weights in (4.13) depend on the unknown C How- 
ever, it turns out that it is possible to use any weighting function which is 
dominated by w(u), that is, one needs only some lower bounds for £ A . 

Remark 4.14. It is interesting to compare (4.12) and (4.14) with Theo- 
rem 3.2 in Horowitz and Mammen (2007). At first sight it may seem strange 
that, while the rates of convergence for our "link" function Ca and the 
"components" ipk depend on the tail behavior of C' A , the rates in Horowitz 
and Mammen (2007) rely only on the smoothness of the link function and 
the components. The main reason for this is that the derivative of the link 
function in the above paper is assumed to be uniformly bounded from below 
[assumption (A8)], a restriction that can be hardly justifiable in our setting. 
The convergence analysis in the unbounded case is, in our opinion, an im- 
portant contribution of this paper to the problem of estimating composite 
functions that can be carried over to other setups and settings. 
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4.7. Discussion. As can be seen, the estimate can exhibit various 
asymptotic behavior depending on the underlying Levy process Lt and the 
time-change T(t). In particular, if the Laplace transform Ct(z) of T dies off 
at exponential rate as Kez — > +00 and fik = 0, then the rates of convergence 
of Dk are logarithmic and depend on the Blumenthal-Geetor index of the 
Levy process Lt. The larger is the Blumenthal-Geetor index, the slower 
are the rates and the more difficult the estimation problem becomes. For 
the polynomially decaying Ct{z) one gets polynomial convergence rates that 
also depend on the Blumenthal-Geetor index of Lt. Let us also note that 
the uniform rates of convergence are usually rather slow, since (3 < 1 — 7 in 
most situations. The pointwise convergence rates for points xq 7^ can, on 
the contrary, be very fast. The rates obtained turn out to be optimal up 
to a logarithmic factor in the minimax sense over the classes &p n 5S 7 and 
F) s (xo,5,D)n<B 1 . 

5. Simulation study. In our simulation study, we consider two models 
based on time-changed normal inverse Gaussian (NIG) Levy processes. The 
NIG Levy processes is a relatively new class of processes introduced in 
Barndorff-Nielsen (1998) as a model for log returns of stock prices. The 
processes of this type are characterized by the property that their incre- 
ments have NIG distribution. Barndorff-Nielsen (1998) considered classes of 
normal variance-mean mixtures and defined the NIG distribution as the case 
when the mixing distribution is inverse Gaussian. Shortly after its introduc- 
tion, it was shown that the NIG distribution fits very well the log returns on 
German stock market data, making the NIG Levy processes of great interest 
for practioneers. A NIG distribution has in general four parameters: a G R+, 
^6 8, 6 € M+ and /i£t with |x| < a. Each parameter in NIG(a, x, S,fi) 
distribution can be interpreted as having a different effect on the shape of 
the distribution: a is responsible for the tail heaviness of steepness, x has 
to do with symmetry, 5 scales the distribution and \i determines its mean 
value. The NIG distribution is infinitely divisible with c.f. 



(j){u) = exp{<5(v a 2 — x 2 — \J a 2 — (x + iu) 2 + ifiu)}. 

Therefore, one can define the NIG Levy process {Lt)t>o which starts at zero 
and has independent and stationary increments such that each increment 
L t +A — L t has NIG(a, x, Ad, A/x) distribution. The NIG process has no dif- 
fusion component making it a pure jump process with the Levy density 

,„ . . 2a5 exp(xx)-FTi(a|d) 
(5.1) v[x) = n , 

7T \X\ 

where K\(z) is the modified Bessel function of the third kind. Taking into 
account the asymptotic relations 

Ki(z)x 2/z, and K\(z) >c J ^~e~ z , z^+00, 
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we conclude that v G *Bi and v G Sj s (xo,S, D) for arbitrary large s > and 
some 6 > 0, D > 0, if xo 7^ 0. Moreover, assumption (AL2) is fulfilled for any 
p > 0. Furthermore, the identity 

-^log<Xu) = -a 2 /(a 2 - (x + m) 2 ) 3/2 
air 

implies 1^ S ©2-5 for arbitrary small 5 > 0. In the next sections are going 
to study two time-changed NIG processes: one uses the Gamma process as 
a time change and another employs the integrated CIR processes to model T ' ■ 

5.1. Time change via a Gamma process. Gamma process is a Levy pro- 
cess such that its increments have Gamma distribution, so that T is a pure- 
jump increasing Levy process with the Levy density 

vt{x) = Ox^ 1 exp(— Ax), x > 0, 

where the parameter 8 controls the rate of jump arrivals and the scaling 
parameter A inversely controls the jump size. The Laplace transform of T 
is of the form 

C t (z) = (1 + z I \y et , Rez>0. 

It follows from the properties of the Gamma and the corresponding inverse 
Gamma distributions that assumptions (ATI) and (AT2) are fulfilled for 
the Gamma process T, provided OA > 2/7. Consider now the time-changed 
Levy process Yt = ^Tit) where Lt = {L\,Ll,Ll) is a three-dimensional Levy 
process with independent NIG components and T is a Gamma process. Note 
that the process Yj is a multidimensional Levy process since T was itself the 
Levy process. Let us be more specific and take the A-increments of the Levy 
processes Lj, L 2 t and L\ to have NIG(1, -0.05, 1, -0.5), NIG(3, -0.05, 1,-1) 
and NIG(1, —0.03, 1,2) distributions, respectively. Take also 9 = 1 and A = 1 
for the parameters of the Gamma process T ■ Next, fix an equidistant grid 
on [0,10] of the length n = 1,000 and simulate a discretized trajectory of 
the process Y t . Let us stress that the dependence structure between the 
components of Yt is rather flexible (although they are uncorrelated) and 
can be efficiently controlled by the parameters of the corresponding Gamma 
process T. Next, we construct an estimate v\ as described in Section 3.2. 
We first estimate the derivatives (j>i, (p2, <Pn an d <j)±2 by means of (3.7) 
and (3.8). Then we estimate ip'{(u) using the formula (3.10) with k = 1 and 
/ = 2. Finally, we get v\ from (3.12) where the kernel tC is chosen to be the 
so-called flat-top kernel of the form 

fl, |x|<0.05, 
-l/(|a,|-0.05)- 

K{x) 



ex P ; rn — I, 0.05<x<l, 

lCJ, |x|>l. 
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The flat-top kernels obviously satisfy assumption (AK). Thus, all assump- 
tions of Theorem 4.4 are fulfilled and Corollary 4.5 leads to the following 
convergence rates for the estimate v\ of the function P\(x) = x 2 v(x): 

ll-i-- 1 |U oo ( K ^) = O a . s .(^ (1 " <5 ' )/(0A+5/2) log^'V^ 5 / 2 )^)), n^oc, 

with arbitrary small positive numbers 5' and e', provided the sequence h n 
is chosen as in Corollary 4.5. Let us turn to the finite sample performance 
of the estimate v\. It turns out that the choice of the sequence h n is cru- 
cial for a good performance of v\. For this choice, we adopt the so called 
"quasi-optimality" approach proposed in Bauer and Reifi (2008). This ap- 
proach is aimed to perform a model selection in inverse problems without 
taking into account the noise level. Although one can prove the optimality 
of this criterion on average only, it leads in many situations to quite reason- 
able results. In order to implement the "quasi-optimality" algorithm in our 
situation, we first fix a sequence of bandwidths hi, . . . , Hl and construct the 
estimates , ■ ■ ■ , using the formula (3.12) with bandwidths hi,... 
respectively. Then one finds I* = argmin^ f(l) with 

/(0 = II^ +1) -^ ) IIl i( r), i = i,. 

Denote by v\ = v\ a new adaptive estimate for v\. In our implementation 
of the "quasi-optimality" approach, we take h\ = 0.5 + 0.1 x I, I = 1, . . . , 40. 
In Figure 1, the sequence /(/), I = 1,...,40, is plotted. On the right-hand 
side of Figure 1, we show the resulting estimate v\ together with the true 
function v\. Based on the estimate V\, one can estimate some functionals 
of v\. For example, we have J Vi(x) dx = 1.049053 [J &i(x) dx = 1.015189]. 




Fig. 1. Left-hand side: objective function f(l) for "quasi-optimality" approach versus the 
corresponding bandwidths hi, 2 = 1,. ..,40. Right-hand side: adaptive estimate v\ (dashed 
line) together with the true function v\ (solid line). 
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5.2. Time change via an integrated CIR process. Another possibility to 
construct a time-changed Levy process from the NIG Levy process Lt is to 
use a time change of the form (2.2) with some rate process p(t). A possible 
candidate for the rate of the time change is given by the Cox-Ingersoll- 
Ross process (CIR process). The CIR process is defined as a solution of the 
following SDE: 

dZ t = n{ri-Z t )dt + Cy%dWt, Z = l, 

where Wt is a Wiener process. This process is mean reverting with k > 
being the speed of mean reversion, i] > being the long-run mean rate and 
C > controlling the volatility of Z%. Additionally, if 2nr] > ( 2 and Zq has 
Gamma distribution, then Z% is stationary and exponentially a-mixing [see, 
e.g., Masuda (2007)]. The time change T is then defined as 

T(t)= [ Z t dt. 
Jo 

Simple calculations show that the Laplace transform of T{t) is given by 

= exp(ftV/C 2 ) exp(-2z/( K + 7 (z) coth( 7 (z)t/2))) 
t[Z) (cosh(7(z)i/2) + Ksinh( 7 (z)t/2)/7(z)) 2K VC 2 

with j(z) = \/ k 2 + 2( 2 z. It is easy to see that Ct(z) x exp(— -^P[l + tKij]) as 
\z\ — > oo with Rez > 0. Moreover, it can be shown that E|7~(t)| p < oo for any 
p£K. Let Lt be again a three-dimensional NIG Levy process with indepen- 
dent components distributed as in Section 5.1. Construct the time-changed 
process Y t = Lj-m . Note that the process Y t is not any longer a Levy pro- 
cess and has in general dependent increments. Let us estimate Pi, the trans- 
formed Levy density of the first component of Lt. First, note that according 
to Theorem 4.4, the estimate v\ constructed as described in Section 3.2, has 
the following logarithmic convergence rates 

ll^i - ^llUooCR,™) =Oa.s.( lo g~ 2(2 ~ 5) ( n ))i n-^oo, 

for arbitrary small 5 > 0, provided the bandwidth sequence is chosen in the 
optimal way. Finite sample performance of u\ with the choice of h n based on 
the "quasi-optimality" approach is illustrated in Figure 2 where the sequence 
of estimates , . . . , was constructed from the time series Y"a ; • • • > ^nA 
with n = 5,000 and A = 0.1. The parameters of the used CIR process are 
k = 1, rj = 1 and £ = 0.1. Again we can compute some functionals of v\. We 
have, for example, following estimates for the integral and for the mean of v\\ 
\v\{x) dx = 1.081376 [$v 1 (x)dx = 1.015189] and f xi>i(x) dx = -0.4772505 
[[ xvi(x) dx = -0.3057733]. 

Let us now test the performance of estimation algorithm in the case of 
a time-changed NIG process (parameters are the same as before), where 
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Fig. 2. Left-hand side: objective function f(l) for the "quasi-optimality" approach ver- 
sus the corresponding bandwidths hi. Right-hand side: adaptive estimate T>\ (dashed line) 
together with the true function v\ (solid line). 



the time change is again given by the integrated CIR process with the pa- 
rameters rj = 1, £ = 0.1 and k G {0.05,0.1,0.5, 1}. Figure 3 (left) shows the 
boxplots of the resulting error \\i>\ —^i\\l x (r,w) computed using 100 trajec- 
tories each of the length n = 5,000, where the time span between observa- 
tion is A = 0.1. Note that if our time units are days, then we get about 
two years of observations with about one mean reversion per month in the 
case k = 0.05. As one can see, the performance of the algorithm remains 
reasonable for the whole range of k. In Figure 3(right), we present the box- 
plots of the error \\ui — i'i\\L ao (R,w) m the case of r\ = 1, £ = 0.1, k = 1 and 
n £ {500,1,000,3,000,5,000}. As one can expect, the performance of the 
algorithm becomes worse as n decreases. However, the quality of the esti- 
mation remains reasonable even for n = 500. 



6. Proofs of the main results. 



6.1. Proof of Theorem 4-4- For simplicity, let consider the case of [i\ > 
and <Tfc = 0. By Proposition 7.4 [take G n (u, z) = exp(iuz), L n = fx n = a n = 1, 
a = 0,6 = 1] 

n\M0)\ < n/y/n) > P(M0) - <M0)| > in) < Bn- 1 - 5 

for some constants 5 > 0, B > and n large enough. Furthermore, simple 
calculations lead to the following representation: 

Vi>) - ^, 2 (u) = ^(<M0) - <M0)) 

(6.1) ' 

+ n {u) + ni{u) + n 2 {u), 
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Fig. 3. Boxplots of the error \\ui — vi\\L ao (R,w) f or different values of the mean reversion 
speed parameter ft and different numbers of observations n. 
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K (u) = [Vi(u)i,l(u) - V 2 (u)4>' k ( U )](Mu {k) ) ~ Uu {k) )) 

+ ^ 2 («)(0fc(« (fc) )-0fc(« (fc) )) 
- Vl {u)^ kk {u^)-^ kk {u^)) 
+ V l (u)^(u)( ( f> lk (uM) -j lk (u^)), 
U^u) = [V±{u)<{u) - %{uW k {u)]{Uu {k) ) - Uu {k) )) 

+ v 2 (u)(4> k (u {k) )-Mu^)) 

+ ^(«)^(«)(^(u (fc) )-^j fe (« (fc) )) > 

)= { ) M^n 2 



+ 



Vi{u) 



(<MQ)-<MQ)) 

0,(0) 



0,(0) 

1 



A&(« (fc) )~ ^ A (-^("))' 



^ = A[*(«M)]> = " yi( * W W 

Vi(u) = (i» - i)Vi(«), y 2 (n) = (r 2 («) - i)y 2 («) 



r(u) 



i 



^ («(*)) 



(0i(« (fc) )-0i(« (fc) )) 



The representation (6.1) and the Fourier inversion formula imply the follow- 
ing representation for the deviation v k — v k : 



v k {x) -v k (x) 



1 (0,(0) -0,(0)) 



2tt 

1 
1 



MO) 



c ip k (u)lC(uh n ) du 



c 7lo(u)lC(uh n ) du 
c lZi(u)lC(uh n ) du 



STATISTICAL INFERENCE FOR TIME-CHANGED LEVY PROCESSES 27 



c lZ2(u)lC(uh n ) du 

c (l — K.(uh n ))tp'l(u) du. 



First, let us show that 



sup 

x6M 



c TZi(u)IC(uh n ) du 



'log 3+£ ra f 1 ' 11 



n 



l/hr, 



SH?(it) 



and 



sup 



/ 



1 log 3+£ n 



On 



9\ 2 k (u)du 



'"- x TZ2(u)IC(uh n ) du 
We have, for example, for the first term in TZ±(u) 

^ e -i«(r(«) - l^u^'HtoCu^) - «« (f:) ))^K) dn 

< sup |r(«)-l|sup[«;(H)|0,(«( fc ))-0 l (« (fc) )l]«'" 1 (l/M 
|u|<i//i„ weR 

1^(^)11^(^)1 

with u;(u) = log _1//2 (e + it), it > 0. Fix some £ > and consider the event 
A = 



sup 

{|«|<1/A„} 



[w(\u\)\Uu^)-Mu {k) )\}<^ 



logn 



n 



By assumption (AH), it holds on A that 



sup 

\u\<l/h 7 



< iM nW - l {l/h n )^\ogn/n 
= o(\fhn), n— >oo, 



and hence 
(6.2) 



sup |1 — r(u)| = o(y h n ), n— >oo. 

{\u\<l/h n } 



Therefore, one has on A that 

•l/hn 



sup 



l/h„ 



\T{u) - l)Vl(«)<(«)(<Mu W ) " Mu (k) ))JC(uh n )du 



h n log n 



n 



l/hn 
-l/hn 



SHfc(w) du) =o 



n 



l/hr, 



9%{u) du 
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since ip'l{u) and JC(u) are uniformly bounded on M. On the other hand, 
Proposition 7.4 implies [on can take G n (u, z) = exp(inz), L n = p, n = a n = l, 
o = 0, 6 = 1] 



-1-5' 



p (-4-) ^ n ~ ' " > oo, 
for some 5' > 0. The Borel-Cantelli lemma yields 



sup 

x6M 



1 log 3+e n 



n 



l/h n 

l/hn 



y\ 2 k {u)du 



Other terms in 1Z\ and IZ2 can be analyzed in a similar way. Turn now to 
the rate determining term IZq. Consider, for instance, the integral 



(6.3) 



with 



l/hn 
l/hn 



Aux Vi (u)^{u) (<f>i («(*) ) - fa {u^))K,{uh n ) du 



nh, 



Z)K n 



x-Z) 



1 



E<! Z l —K„ 
K 



x-Z k 



S{x) 



K n (z) 



'Vi(u/ h n )tp'l(u/ h n )fC(u) du. 



-1 



Now we are going to make use of Proposition 7.4 to estimate the term S(x) 
on the r.h.s. of (6.3). To this end, let 



G n (u,z) 



1 



u — z 



Since Vki v l G ^7 f° r some 7 > [assumption (AL1)], the Levy processes L\ 
and L\ possess infinitely smooth densities pk,t and pij which are bounded 
for t > [see Sato (1999), Section 28] and fulfill [see Picard (1997)] 



(6.4) 
(6.5) 



sup{p fc> t(x)} <t~ l/ \ t->0, 



sup{p M 0r)}<^, 



0. 



Moreover, under assumption (AL2) [see Luschgy and Pages (2008)] 



(6.6) \x\ m p kit (x)dx = 0(t), 



\x\ m pi tt (x)dx = 0(t), t->0, 
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and 



(6.7) 



x\ m p ktt (x)dx = 0(t m ), 
\x\ m p l>t (x)dx = 0(t m ), i^+oo, 



for any 2 < m < p. As a result, the distribution of (Z k ,Z l ) is absolutely 
continuous with uniformly bounded density q^i given by 



Qki(y,z)= Pk,t{y)Pi,t{z)dit{dt), 
Jo 

where ir is the distribution function of the r.v. T(A). The asymptotic rela- 
tions (6.4)-(6.7) and assumption (ATI) imply 



E[\Z l \ 2 \G n (u,Z 



k\\2i 



< 



1 

K 

Co 



A", 



u-y 



\z\ 2 qki{y,z) dz \dy 



K 



K n (v)\ 2 dv 

l/h„ 



l/h r , 



\V!(u)\ z du 



with some finite constants Cq > and C\ > 0. Similarly, 

-l/hn 



E[\Z k \ 2 \G n (u,Z k )\ 2 ]<C 2 
E[\Z k \ 4 \G n (u,Z k )\ 2 ]<C 3 



E[\Z k \ 2 \Z l \ 2 \G n (u,Z k )\ 2 } <C 4 



■l/K 

l/hn 

■l/K 

l/hn 
l/K 



\Vi{u)\ z du, 



\Vi{u)\ 2 du, 



|Fi(u)| 2 du 



with some positive constants 62,63 and C4. Define 

"l/hn 



K = C 



l/hn 



\V\(u)\ l du, 



l/K 



Un=\\1C\\ooU"\\oo / \Vy{u)\du 



00 || lloo 



|u||Vi(u)| dit, 
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where C = maxfc = i 5 2,3,4{Cfc}. Since | Vi — > oo as \u\ — > oo and h n — > oo, 
we get fi n /&n = 0(1)- Furthermore, due to assumption (AH) 

(6-8) fm^K^an^n 1 ' 2 - 5 / 2 ^ L n <h^ 2 o n <r?l 2 o n , n^oc, 

and a n = 0(h~ l,2 M n ) = 0{n 1 / 2 ). Thus, assumptions (AG1) and (AG2) of 
Proposition 7.4 are fulfilled. Assumption (AZ1) follows from Lemma 7.1 and 
assumption (ATI). Therefore, we get by Proposition 7.4 



'^log 3+£ 



P supK|z|)|S(z)|]>a 
\zm V n 

for some 5' > and £ > £o- Noting that 

-l/h n 



n 



< 



-1-5' 



°l<c [ n m 2 k (u)du 

J-l/hn 



we derive 



S np[w(\z\)\S(z)\]=O a . s .[J^ / 9\ 2 {u)du 



Other terms in TZq can be studied in a similar manner. Finally, 



/ /log 3+e ra f 1/hn 
\h ~ fk\\L x (R,w) = °a.s. \ / V%(u) du 



n 



l/h n 



(6.9) 

+ I \l-K{uh n )Wl{u)\du. 
^ Jr 

The second, bias term on the r.h.s. of (6.9) can be easily bounded if we recall 
that v k G &p and K,{u) = 1 on [—clk,cik} 

±- [ \l-K{uh n )Wl{u)\du<hP n [ \uf\F[P k ](u)\du 

Z7T JM J{\u\>a K /h n } 

<h? [ (l + \uf)\F[v k ](u)\du, n^oo. 



6.2. Proof of Theorem 4-7. We have 

1 



%(xq) - ffc(x ) 



2tt 



iux .iM 



tfj k (u)lC(uh n ) du - v k {x Q ) 



J1 + J2 



STATISTICAL INFERENCE FOR TIME-CHANGED LEVY PROCESSES 31 
Introduce 



K(z) 



1 

2^ 



e luz K{u) du, 



i 



then by the Fourier inversion formula 



(6.10) 



K{u) 



e~ luz K(z) dz. 



Assumption (AK) together with the smoothness of tC implies that K(z) has 
finite absolute moments up to order m> s and it holds that 



(6.11) 
Hence 

and 



j K(z)dz = l, Jz k K(z)dz = 0, k = l,...,m. 



Ji 



v k (x Q + h n v)K(v) dv - v k (xo) 



\v\>S/hn 



Wk(xo) - Vk{xo + h n v)]K(v) dv 



[vk{xo) - v k (x + h n v)]K(v) dv 



+ 

' '\v\<S/h n 
= h+h- 

Since ||^||oo < Cp for some constant Cp > 0, we get 



h < 2C P 



\K(v)\dv<C p C K (h n /Sy 



<\v\>6/h n 

with Ck = /jj |-f^(' u )||' u \ m dv. Further, by the Taylor expansion formula, 



3=0 



'\v\<S/h„ 
hi + ^22- 



K(v) 



K(v)v J dv 

\v\<S/hn 



■I'D 



(--1)1 



dv 



First, let us bound I21 from above. Note that, due to (6.11), 



'21 



- 1 



j=0 



\v\>8/h„ 



K{v)v 3 dv 
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Hence, 

V 5 J r- J\v\>s/h n 

<(thL\ m LC K exp(6). 

Furthermore, we have for I22 

T h s f 

h2<^\ \K(v)\\v\ s dv. 

s! J\v\<S/hn 

Combining all previous inequalities and taking into account the fact that 
m> s, we derive 



The stochastic term J2 can handled along the same lines as in the proof of 
Theorem 4.4. 

6.3. Proof of Theorem 4-9. Define 

2 



n /sin(a^) 



a k x 

k=l 

with afc = 2" k ,k G N. Since Kq(x) is continuous at and does not vanish 
there, the function 

1 sin(2x) K (x) 
[X) ~2ir ttx K (0) 

is well defined on R. Next, fix two positive numbers ft and 7 such that 
7 6 (0, 1) and < ft < 1 — 7. Consider a function 

$(V) = - 

V ' (l + n 2 )( 1 +«/2l og 2( e + u 2) 

for some xq > and define 



Hh{x) = I fi(x + zh)K(z) dz 
J —00 

for any /i > 0, where 

fi(x) = ^- r e-' lxu 3>{u)du. 
In the next lemma, some properties of the functions fi and //^ are collected. 
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Lemma 6.1. Functions fx and \ih have the following properties: 

(i) fi and fj,h are uniformly bounded on R, 

(ii) for any natural n > 

(6.12) max{n(x),fi h (x)} < \x\~ n , \x\->oo, 

that is, both functions fi(x) and jJLh{%) decay faster than any negative power 
of x, 

(iii) it holds 

(6.13) x 2 0f i(x ) - xfa h (x ) > Dh? log-^l/Zi) 
for some constant D > and h small enough. 



Fix some e > and consider two functions 

1-g 
(1+x 



vi (x) = z/ 7 (x) + + e/i(x) 



where zz-^x) is given by 



u^{x) 



(1 + x 2 



^l{x>0} + -^l{x<0} 



Due to statements (i) and (ii) of Lemma 6.1, one can always choose e in such 
a way that v\ and v 2 stay positive on R + and thus they can be viewed as the 
Levy densities of some Levy processes L\ jt and L 2 j, respectively. It directly 
follows from the definition of v\ and v 2 that V\ , v 2 G 5S 7 . The next lemma 
describes some other properties of v\{x) and v 2 {x). Denote v\(x) = x 2 v\(x) 
and v 2 (x) = x 2 v 2 {x). 

Lemma 6.2. Functions v\{x) and v 2 {x) satisfy 
(6.14) sup |Pi (x) -v 2 {x)\ > eDh 13 log" 1 (1/h) 

and 

/oo 
(l + \uf)\F[j?i}(u)\du<oo, z = l,2, 
-oo 

that is, both functions v\{x) and v 2 (x) belong to the class &»■ 

Let us now perform a time change in the processes L\j and L 2 ,t- To this 
end, introduce a time change Tit), such that the Laplace transform of T(t) 
has following representation: 



POO 

C t (z)=E[e- zT ^]= / e- z VdF t (y), 
Jo 
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where (i*t,i > 0) is a family of distribution functions on M + satisfying 

l-F t (y) <1-F s (y), y E R + , 

for any t < s. Denote by pi t and p2,t the marginal densities of the resulting 
time-changed Levy processes Yij = Lij~(t) an< ^ ^2,t = ^2,T(t)> respectively. 
The following lemma provides us with an upper bound for the x 2 -divergence 
between pn and piti where for any two probability measures P and Q the 
X 2 -divergence between P and Q is defined as 

-1^ dQ, if P<Q, 



x'(P,Q) 




otherwise. 



Lemma 6.3. Suppose that the Laplace transform of the time change T(t) 
fulfills 

(6.16) \CI +I \z)/Cl\z)\ = 0(1), |*| -> oo, 

/or A; = 0, 1,2, and uniformly in A £ [0, 1]. TTien 

X 2 (pi,a,? 2 ,a) < A^^)] 2 /^ 1 ), h ->• 0, 
lozf/i some constant c> 0. 

The proofs of Lemmas 6.1, 6.2 and 6.3 can be found in the preprint ver- 
sion of our paper Belomestny (2010a). Combining Lemma 6.3 with inequal- 
ity (6.14) and using the well-known Assouad lemma [see, e.g., Theorem 2.6 
in Tsybakov (2004)], one obtains 

liminfinf sup Pfsup \P(x) — v{x)\ > log _1 (l//i n ) ) > 

n->oc u yg^ne^ Wir ' 

for any sequence h n satisfying 

nA- 1 [C' t (c ■ h^)] 2 hW+V = O(l), n ^ oo. 

7. Auxiliary results. 

7.1. Some results on time-changed Levy processes. 

Lemma 7.1. Let Lt be a d-dimensional Levy process with the Levy mea- 
sure v and let T(t) be a time change independent of Lt. Fix some A > and 
consider two sequences = T(Ak) — T(A(k — 1)) and = Y^fc — ^A(fc-i)> 
k = 1, . . . ,n, where Y t = Lf(t). If the sequence (Tk)keN is strictly station- 
ary and a-mixing with the mixing coefficients (ctT(j))jeN, then the sequence 
(Zfc)fc 6 N is also strictly stationary and a-mixing with the mixing coefficients 
(az(j))je®, satisfying 

(7.1) a z (j) < a T (j), j G N. 
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Proof. Fix some natural k, I with k + l <n. Using the independence of 
increments of the Levy process Lt and the fact that T is a nondecreasing 
process, we get E[^(Zi, . . . , Z k )] = E[</>(Ti, . . . , T k )] and 

E[cj)(Z 1 , . . . , Z k )ip(Z k+ i, Z n )\ 

= E[0(Ti, . . . , T k )$(T k+ i, ...,T n )], k,leN, 

for any two functions <j> : M fc — > [0, 1] and tp : M. n ~ l ~ k — > [0, 1] , where <f>(ti , . . . , 
t k ) = E[cj)(L tl , . . . ,L tk )\ and ^(h, . . . , t k ) = E[i/j(L tl , . . . , L tk )]. This implies 
that the sequence Z k is strictly stationary and a-mixing with the mixing 
coefficients satisfying (7.1). □ 

7.2. Exponential inequalities for dependent sequences. The following the- 
orem can be found in Merlevede, Peligrad and Rio (2009). 

Theorem 7.2. Let (Z k ,k > 1) be a strongly mixing sequence of centered 
real-valued random variables on the probability space (f2, J 7 , P) with the mix- 
ing coefficients satisfying 

(7.2) a(n) < aexp(-cn), n > 1, a > 0,c > 0. 

Assume that sup k> i\Z k \ < M a.s., then there is a positive constant C de- 
pending on c and a such that 



^2Zi>( \ <exp 



K i=l ) 

for all Q > and n>A, where 



v 2 



nv 2 + M 2 + MC log 2 (n) J 
supf E[Z t } 2 + 2^ Cov(Z u Zj)) . 



Corollary 7.3. Denote 

^EfZjlog^lZ/)], j = l,2,..., 
with arbitrary small e > and suppose that all pj are finite. Then 

y Cov(Zj, Zj) < C max p~ 

trj 3 

for some constant C > 0, provided (7.2) holds. Consequently, the following 
inequality holds: 

v 2 < supE[Zj] 2 + C max pj. 

i 3 

The proof can be found in Belomestny (2010a). 
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7.3. Bounds on large deviations probabilities for weighted sup norms. Let 
Zj = (Xj,Yj), j = 1, . . . , n, be a sequence of two-dimensional random vectors 
and let G n (u,z), n= 1,2,..., be a sequence of complex-valued functions 
defined on R 2 . Define 

1 n 

fhi(u) = -^2XjG n (u,Xj), 
1 - 

m 2 (u) = - ^2YjG n (u,Xj), 
1 n 

rn 3 (u) = - y~]x?G n (u,Xj), 

71 . 
1 n 

m 4 (it) = - ^ XjYjG n (u, Xj ) . 
w . 
J=l 

Proposition 7.4. Suppose that the following assumptions hold: 

(AZ1) The sequence Zj, j = 1, . . . ,n, is strictly stationary and is a-mixing 
with mixing coefficients (az(k))ken satisfying 

&z{k) < <5o exp(— a\k), k € N, 

for some q,q > and a\ > 0. 

(AZ2) T/ie r.v. Xj and Yj possess finite absolute moments of order p > 2. 

(AG1) Each function G n (u, z), n G N is Lipschitz in u with linearly grow- 
ing (in z) Lipschitz constant, that is, for any U\,U2 GS 

|G n (ni,z) - G n (/u 2 ,z)| < L n {a + b\z\)\u\ - u 2 \, 

where a, b are two nonnegative real numbers not depending on n and the 
sequence L n does not depend on u. 

(AG2) There are two sequences p, n and a n , such that 

\G n (u,z)\ </2 n , (u,z)eR 2 , 

and all the functions 

E[(\X\ 2 + \Y\ 2 )\G n (u, X)\ 2 }, E[\X\*\G n (u, X)\ 2 }, 

E[\X\ 2 \Y\ 2 \G n (u,X)\ 2 ] 

are uniformly bounded on R by a 2 . Moreover, assume that the sequences 
p n ,L n and a n fulfill 

fl n /a 2 n = 0(l), fl n /a n = 0(n 1 / 2 - s / 2 ), a 2 n = 0(n), 

L n /v n = 0(n 3/2 ), n->oo, 
for some 5 satisfying 2/p < 5 < 1 . 
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Let w be a symmetric, Lipschitz continuous, positive, monotone decreasing 
on R + function such that 

(7.3) 0<w(z) < log~ 1/2 (e + |z|), zet. 

Then there is 5' > and £q > 0, such that the inequality 



(7.4) P|log-( 1+£ )(l + ji n )^-^—\\rh k - E[m fc ]|| iooM > £ j < SrT 1 - 5 ' 

ZioWs /or any £ > £o> « n 2/ & G {lj • ■ • j 4}, some positive constant B depending 
on £ and arbitrary small e > 0. 

The proof of the proposition can be found in Belomestny (2010a). 
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